Topological solution of missing attribute values problem in incomplete information tables
نویسنده
چکیده
In this paper, we present a method of data decomposition to avoid the necessity of reasoning on data with missing attribute values. We defined a general binary relation on the original incomplete data which decomposed into data subsets without missing values. Next, topological base relation method for classifier induction is applied to such sets. New approach to find the missing values in consistent data is discussed. New pretopological approximations of concepts are studied and some of their properties are proved. Pre-topological measures are initiated in incomplete information systems and more pre-approximations are defined and studied. Finally, the reducts and core are determined.
منابع مشابه
A Rough Set Model Based on Probabilistic Similarity Measure for Incomplete Decision Tables
Rough set models in incomplete decision tables have been discussed so far. Numerous approaches to deal with missing values in incomplete information systems have been proposed. In this paper, assuming that the domain of attribute values is defined, we apply the probability of values appearing in data tables in order to measure the self-information of similarity. This is defined as the uncertain...
متن کاملRough Set Approaches to Rule Induction from Incomplete Data
In this paper we assume that data are presented in the form of decision tables, incomplete when some attribute values are missing. Two main cases of missing attribute values are considered: lost (the original value was erased) and "do not care" conditions (the original value was irrelevant). This paper uses, as the main tool, attribute-value pair blocks. These blocks are used to construct chara...
متن کاملData with Missing Attribute Values: Generalization of Indiscernibility Relation and Rule Induction
Data sets, described by decision tables, are incomplete when for some cases (examples, objects) the corresponding attribute values are missing, e.g., are lost or represent “do not care” conditions. This paper shows an extremely useful technique to work with incomplete decision tables using a block of an attribute-value pair. Incomplete decision tables are described by characteristic relations i...
متن کاملA BAYESIAN APPROACH TO COMPUTING MISSING REGRESSOR VALUES
In this article, Lindley's measure of average information is used to measure the information contained in incomplete observations on the vector of unknown regression coefficients [9]. This measure of information may be used to compute the missing regressor values.
متن کاملA Comparative Analysis of Multigranular Approaches and on Topoligical Properties of Incomplete Pessimistic Multigranular Rough Fuzzy Sets
Rough sets, introduced by Pawlak as a model to capture impreciseness in data have been a very useful tool in several applications. These basic rough sets are defined by taking equivalence relations over a universe. In order to enhance the modeling powers of rough sets, several extensions to the basic definition has been introduced over the past few years. Extending the single granular structure...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Inf. Sci.
دوره 180 شماره
صفحات -
تاریخ انتشار 2010